Neural learning for articulatory speech synthesis under different statistical characteristics of acoustic input patterns

نویسندگان

  • H. Altun
  • K. Mervyn Curtis
  • T. Yalcinoz
چکیده

Input data representation is highly decisive in neural learning in terms of convergence. In this paper, within an analytical and statistical framework, the effect of the distribution characteristics of the input pattern vectors on the performance of the back-propagation (BP) algorithm is established for a function approximation problem, where parameters of an articulatory speech synthesizer are estimated from acoustic input data. The aim is to determine the optimum statistical characteristics of the acoustic input patterns in order to improve neural learning. Improvement is obtained through a modification of the statistical characteristics of the input data, which reduces effectively the occurrence of node saturation in the hidden layer. 2002 Elsevier Science Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic-to-articulatory Neural Mapping under Different Statistical Characteristics of Articulatory Pattern Vectors

This paper describes a mapping problem that tests and validates the findings from our analytical analysis of neural learning[1]. In this analysis different statistical characteristics of the target pattern vectors were investigated as to their effect on learning and generalisation. The problem reported on is a difficult function approximation problem, where the parameters of an articulatory spe...

متن کامل

The Accurate Estimation of Articulatory Synthesiser Parameters through Reducing the Degree of Saturation

A new method is proposed to correctly estimate the parameters of an articulatory speech synthesiser using a MLP neural network. This is achieved through modifying the statistical characteristic of the acoustic input pattern vectors in order to prevent the activation level of the hidden nodes from approaching saturation. The technique results in considerably faster neural learning and a more acc...

متن کامل

Integrating Articulatory Information in Deep Learning-Based Text-to-Speech Synthesis

Articulatory information has been shown to be effective in improving the performance of hidden Markov model (HMM)based text-to-speech (TTS) synthesis. Recently, deep learningbased TTS has outperformed HMM-based approaches. However, articulatory information has rarely been integrated in deep learning-based TTS. This paper investigated the effectiveness of integrating articulatory movement data t...

متن کامل

Visual synthesis of source acoustic speech through kohonen neural networks

The objective of bimodal (audio-video) synthesis of acoustic speech has been addressed through the use of Kohonen neural architectures encharged of associating acoustic input parameters (cepstrum coe cients) to articulatory estimates. This association is done in real-time allowing the synchronized presentation of source acoustic speech together with coherent articulatory visualization. Di erent...

متن کامل

Data driven articulatory synthesis with deep neural networks

The conventional approach for data-driven articulatory synthesis consists of modeling the joint acoustic-articulatory distribution with a Gaussian mixture model (GMM), followed by a post-processing step that optimizes the resulting acoustic trajectories. This final step can significantly improve the accuracy of the GMM frame-by-frame mapping but is computationally intensive and requires that th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers & Electrical Engineering

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2003